NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Asynchronous Multi-Agent Bandits: Fully Distributed vs . Leader-Coordinated Algorithms

https://doi.org/10.1145/3711696

Wang, Xuchuang; Chen, Yu-Zhen Janice; Liu, Xutong; Yang, Lin; Hajiesmaili, Mohammad; Towsley, Don; Lui, John CS (March 2025, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

We study the cooperative asynchronous multi-agent multi-armed bandits problem, where each agent's active (arm pulling) decision rounds are asynchronous. That is, in each round, only a subset of agents is active to pull arms, and this subset is unknown and time-varying. We consider two models of multi-agent cooperation, fully distributed and leader-coordinated, and propose algorithms for both models that attain near-optimal regret and communications bounds, both of which are almost as good as their synchronous counterparts. The fully distributed algorithm relies on a novel communication policy consisting of accuracy adaptive and on-demand components, and successive arm elimination for decision-making. For leader-coordinated algorithms, a single leader explores arms and recommends them to other agents (followers) to exploit. As agents' active rounds are unknown, a competent leader must be chosen dynamically. We propose a variant of the Tsallis-INF algorithm with low switches to choose such a leader sequence. Lastly, we report numerical simulations of our new asynchronous algorithms with other known baselines.
more » « less
Free, publicly-accessible full text available March 6, 2026
Asynchronous Multi-Agent Bandits: Fully Distributed vs. Leader-Coordinated Algorithms

https://doi.org/10.1145/3726854.3727272

Wang, Xuchuang; Chen, Yu-Zhen Janice; Liu, Xutong; Yang, Lin; Hajiesmaili, Mohammad; Towsley, Don; Lui, John CS (June 2025, ACM)

Free, publicly-accessible full text available June 9, 2026
Hierarchical Learning Algorithms for Multi-scale Expert Problems

https://doi.org/10.1145/3489048.3530967

Yang, Lin; Chen, Yu-zhen Janice; Hajiesmaili, Mohammad H.; Herbster, Mark; Towsley, Don (June 2022, SIGMETRICS/PERFORMANCE '22: Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems)

Full Text Available
Hierarchical Learning Algorithms for Multi-scale Expert Problems

https://doi.org/10.1145/3530900

Yang, Lin; Chen, Yu-Zhen Janice; Hajiesmaili, Mohammad H.; Herbster, Mark; Towsley, Don (May 2022, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

In this paper, we study the multi-scale expert problem, where the rewards of different experts vary in different reward ranges. The performance of existing algorithms for the multi-scale expert problem degrades linearly proportional to the maximum reward range of any expert or the best expert and does not capture the non-uniform heterogeneity in the reward ranges among experts. In this work, we propose learning algorithms that construct a hierarchical tree structure based on the heterogeneity of the reward range of experts and then determine differentiated learning rates based on the reward upper bounds and cumulative empirical feedback over time. We then characterize the regret of the proposed algorithms as a function of non-uniform reward ranges and show that their regrets outperform prior algorithms when the rewards of experts exhibit non-uniform heterogeneity in different ranges. Last, our numerical experiments verify our algorithms' efficiency compared to previous algorithms.
more » « less
Full Text Available
Distributed Bandits with Heterogeneous Agents

https://doi.org/10.1109/INFOCOM48880.2022.9796901

Yang, Lin; Chen, Yu-Zhen Janice; Hajiemaili, Mohammad H.; Lui, John C.; Towsley, Don (May 2022, IEEE INFOCOM 2022 - IEEE Conference on Computer Communications)

Full Text Available

Search for: All records